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Abstract: Better efficiency of the air transport system of a country at the national level, especially in terms of its 
capacity to generate value for passenger flow and cargo transport, effectively depends on the identification of 
the demand generation potential of each hub for this type of sendee. This requires the mapping of the passenger 
flow and volume of cargo transport of each region sen’ed by the system and the number of connections. The 
main goal of this study was to identify important factors that account for the great variability (demand) of 
regional hubsof the airport modal system in operation in the State of Sao Paulo, the most populated and 
industrialized in the Southeast region in Brazil. For this purpose, datasets for each airport related to passengers 
or cargo flow were obtained from time series data in the period ranging from January 01, 2008 to December 
31, 2014. Different data analysis approaches could imply in better mapping of the flow of the air modal system 
from the evaluation of some factors related to operations/volume. Therefore, different statistical models - such 
as multiple linear regression with normal errors and new stochastic volatility (SV) models - are introduced in 
this study, to provide a better view of the operation system in the four main regional hubs, within a large group 
of 32 airports reported in the dataset. 
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I. Introduction 

The initial studies in hub-and-spoke transport 
networks always related themes addressed to the 
transport of passengers and hub location problems, 
the so-called p-hub. In the 2000 and 2010 decades, 
new studies associating an extension of the main 
related issues, such as transport of air cargo, were 
introduced in the literature as a strategy to identify 
the regional hub profile, consequently leading to 
new research themes. 

In this direction, Alumur and Kara (2008) 
introduced the modeling of a network that operates 
with the hub concept, that is, a new class of models 
denoted as the network hub location models 
(NHLM). 

In another paper, Farahani et al. (2009) 
introduced a good review of methods and 
applications for a special class of models denoted as 
Hub Localization Problems (HLP). Later, due to the 
importance of these models, Campbell and O'Kelly 


(2012) provided an overview of the first motivations 
of the academy regarding specific studies with 
emphasis on the analysis of Hub Localization 
Problems (HLP) as the highlight for future research. 

In this context, the issue highlighted for 
research in this paper focuses on how to determine 
strategic hubs linking the flow of passengers to the 
volume of air cargo transported in a given region. In 
this case, the mapping of the key factors of regional 
hubs through the statistical analysis of airport 
operations and economic covariates compose the 
general goal of this paper. 

The specific goals associated to air cargo in 
Brazil and the world are the application of different 
statistical models to analyze the variables and 
covariates assuming a dataset linked to a group of 32 
airports in the State of Sao Paulo obtained from the 
statistical handling reports of the Sao Paulo Airway 
Department (DAESP-http://www.daesp. sp.gov.br/), 
in Brazil. DAESP is an office linked to the air 
transport in the State of Sao Paulo. In this study, we 
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consider the comparison between the development 
and the seasonality of passenger and air cargo and 
the determination of strategic factors that form 
regional hubs from the comparative findings 
between passenger and air cargo in terms of airport 
economic operations. From the 32 airports reported 
in the dataset, our study considers the four most 
important ones, which provides a good overview of 
the sector. 

The shortage of hub-and-spoke studies 
associated with air cargo transport justifies this 
research, and its main contribution is to analyze the 
composition of the formation of regional hubs 
through airport operations and economic factors. 
The use of statistical models such as stochastic 
volatility time series models and linear regression 
models brings us the relevant findings using the 
dataset from DAESP. 

1.1 A brief review of the literature 

The customization of p-hub designs for air 
cargo (Morrel and Pilon, 1999) and location 
distribution of air cargo systems due to transient 
hubs are described by Kara and Tansel (2001). Lin et 
al. (2003) study the indirect connections to large 
postal cargo hub networks in general and the 
multiple use of aircraft capabilities to consolidate 
center-to-center hubs and the development of flow 
models to generate improvements in the strategic 
mapping of hub networks for air cargo. 

Gardiner, Humphreys and Ison (2005), 
Gardiner and Ison (2008), Gardiner. Humphreys and 
Ison (2016) quantify the importance of an individual 
airport through the concept of "airport hierarchy" 
(primary hubs, secondary hubs, tertiary hubs). 
Alumur et al. (2007, 2009) emphasize the need to 
determine potential flows at airports through the 
criterion of the analysis of volumes of cargo 
operations in a multi-modal and hierarchical 
network. 

Several studies emphasize issues related to the 
capacity of an airport (see, for example, Smilowitz 
and Daganzo, 2007; Tan and Kara, 2007; Scholz and 
Cossel, 2011; Petersen, 2007; Bartodziej et al., 2009; 
Leung et al., 2009; Li et al., 2009; Wang and Kao, 
2008; Amaruchkul and Lorchirachoonhkul, 2011). 

Assaf and Gillen (2012) highlight, according to 
the literature on measuring and understanding 
factors affecting airport efficiency, that new studies 
have been introduced considerably over the last few 
years using different methodologies. The authors 
emphasize the use of complex analytical methods, 
such as Data Envelopment Analysis (DEA) and 
stochastic frontier (SF) distance functions. 

Wu, et al. (2011) reported the growth of hub- 
and-spoke networking, allowing for major airports to 
limit the size of passenger demand in the air capture 
to become the main hubs in their respective regions 
for cargo. Gardiner and Ison (2008) reported the 


existence of three classes of important decisions for 
the air cargo operator to choose the airport that will 
operate and the geographical location of the airport, 
financial return and airport security operations. 
Scholz and Cossel (2011) point out three important 
points that serve as an argument and reflection for 
this research: 

V The growth of air cargo tonnages changing the 
relationship between passengers and cargo has 
become a significant source of revenue for 
airlines and airports; 

V The importance of individual airports on a 
network usually assessed by measures based on 
passengers, cargo and operating numbers; 

V The combined passenger and cargo services 
operated in the major airlines. 

Heinicke (2006, 2007), Scholz and Cossel 
(2011) and Onghena (2011) point out that the 
literature is scarce on this subject and this type of 
research has been mostly focused on the integration 
between airlines and not on the integration of hub- 
and-spoke networks. Some important studies on the 
subject are also introduced by other authors (Grosso 
and Shepherd, 2011; Alumur et al., 2012; Bowen, 
2012; Oktal and Ozger, 2013; Lakew and Tok, 
2015). Feng et al. (2015) describe that cargo 
transport is more complex than carrying passengers 
because the former involves more actors, more 
sophisticated processes combining volume, varying 
priority services, integration strategies and 
consolidation and various itineraries in a 
transmission system (see also Huang and Lu, 2015). 

Alamo and Brinati (2006), Lin et al. (2003), 
Tan and Kara (2007), Sholz and Cossel (2011), 
Fraga (2011), Costa et al. (2011), Oliveira and 
Correia (2011), Grosso and Sheperd (2011), Alumur 
et al. (2012), Torquato and Junior (2012, 2014), 
Oktal and Ozger (2013), Lakew and Tok (2015), 
Feng et al. ( 2015), Huang and Lu (2015) used one 
or all of the three decisive points that influenced the 
method of this research: 

V The use of different statistical models in the 
analysis of the data; 

V The use of response variables and covariates; 

V The use of secondary data obtained from 
official offices in the airline industry and 
economy variables to their studies. 

The paper is organized as follows: in section 2, 
we present the goals of the research and the dataset; 
in section 3, we present the proposed methodology; 
in section 4, we present the results considering the 
different models; in section 5, we present a detailed 
discussion of the obtained results for each airport 
considered in the study; in section 6, we present the 
fit of the proposed stochastic volatility (SV) model; 
finally, in section 7, we present some concluding 
remarks. 
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II. Goals of the study and the times series data 

A study to determine strategic hubs associated 
with the flow of passengers and the volume of air 
cargo transported in a region is a starting point for 
research, which is still scarce in many regions of the 
world, and this approach could generate important 
strategic contributions to air transport networks and 
impacts on the local and regional economy of these 
airports. 

The statistical method used in this work seeks 
to analyze the temporal time series data obtained 
from statistical reports of DAESP for the period 
ranging from 2008 to 2014 extracted from a set of 
10,572 observations formed by 32 airports, two 
response variables (volume of passengers and cargo) 
associated with two economic covariates, the dollar 
exchange rate and the unemployment rate obtained 
from the economic Brazilian office of the 
Commercial Association of Sao Paulo (ACSP- 
http://www.acsp.com.br/ ). 

This way, we consider two statistical 
approaches for the data analysis: the use of multiple 
linear regression with normal errors for the 
logarithm transformed dataset and new stochastic 
volatility (SV) models considering the four main 
regional hubs, within the large group of 32 airports 
reported in the dataset. The four airports belong to 
the following locations: Ribeirao Preto, Sao Jose do 
Rio Preto, Bauru/Arealva and Presidente Prudente. 

III. Methodological framework 

In this section, we introduce the different 
modeling approaches to be used in the data analysis: 
multiple linear regression model with normal errors 
and stochastic volatility (SV) models. 

3.1 Use of a multiple linear regression models 

The use of regression models is important to 
report a response variable in conjunction with 
several factors that may be related to this response. 
The construction of general regression models is 
made empirically, and each model must be checked 
for fit from the residual analysis of the model. The 
use of regression models allows to statistically 
identify which of these factors (use of hypothesis 
testing) significantly affect the response. In addition, 
a regression model is also used to forecast future 
values of the response dataset of covariates. 

For the setting of the models, let N SI be a 
fixed integer number that records the amount of 
observed data (in our case, it will represent the 
counting of passengers or cargo for the airports of 
the State of Sao Paulo). Also, let Yj(t), t = 1, 2, . . . , 
N, j = 1, 2, . . . , K, indicating the times series in the 
logarithm scale recording for the counting of 
passengers or cargo in the t th month, t = 1, 2, . . . , N 


and j th airport, j = 1,2,3,4. Here N = 84 months and 
K=4 (j = 1 for Ribeirao Preto; j = 2 for Sao Jose do 
Rio Preto; j = 3 for Bauru/Arealva; j = 4 for 
Presidente Prudente). 

We consider the multiple linear regression 
model in each of the four main airports j = 1,2,3,4: 
Yj(t) = Pj 0 + Pjidollar exchange.rate + 
Pj 2 unemployment.rate+ P^months + P j4 years + e/t), 
(1) 

where the error term e, is assumed as a random 
variable with a normal distribution with mean equal 
to zero and constant variance cr. 

Linear regression models are often fitted using 
the least squares approach. When using more than 
one explanatory variable to predict the behavior of a 
variable response, the model is denoted in the 
literature as a multiple regression model (Draper and 
Smith, 1981). In multiple linear regression analysis, 
the overall effect of covariates on the Y response is 
verified (see, for example. Draper and Smith, 1981; 
Seber and Lee, 2003; Montgomery and Runger, 
2011 ). 

3.2 Use of a Stochastic volatility (SV) models under 
a Bayesian approach 

Stochastic volatility (SV) models have been 
extensively used to analyze financial time series (see 
Danielsson,1994; Yu, 2002) as a powerful 
alternative for existing auto-regressive models such 
as ARCH (autoregressive conditional 
heteroscedastic) introduced by Engle (1982) and the 
generalized autoregressive conditional 

heteroscedastic (GARCH) models introduced by 
Bollerslev (1986) but rarely used in transport or 
other engineering applications (see also Ghysels, 
1996; Kim & Shephard, 1998; or Meyer & Yu, 
2000 ). 

In the presence of heteroscedasticity, that is, 
variances depending on the t time, assume that the 
time series Yj(t), t = 1, 2, . . . , N; j = 1,2,3,4 can be 
written as 

Yj(t) = Pj 0 + Pjidollar exchange.rate+ 
Pj 2 unemployment.rate+ Pj 3 inonths + P |4 years + 

Oj(t)Cj(t) 

(2) 

where ej(t) is a noise considered to be independent 
and identically distributed with a normal distribution 
N(0, of) and Oj(t) is the square root of the variance 
of (2) (for simplicity, it is assumed that oj= 1, since 
in our case the obtained inference results do not have 
significative changes). The variance of Yj(t) is 
assumed to be given by the model a 2 e hj(t) where hj(t) 
depends on a latent variable or unobserved variable. 

It is interesting to observe that usually a 
stochastic volatility process Yj(t) in finance 
applications is given by a special case of the 
equation (2), that is, given by the model Yj(t) = 
Oj(t)ej(t) where Yj(t) is the logarithm of returns and 
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Oj(t) is a strictly stationary sequence of positive 
random variables which is independent of the 
independent identically distributed noise sequence 
Ej(t). 

The independence of the noise e/t) and the 
volatility o/t) allows for a much simpler 
probabilistic structure than that of a GARCH 
(Generalized Autoregressive Conditional 
Heteroscedasticity) process, which includes explicit 
feedback of the current volatility with previous 
volatilities and observations. 

This is one of the advantages of stochastic 
volatility (SV) models. In this case, the mutual 
hj(l) = Mj + Q(l)> t=l, 

h | (2) = p J +<^ lj [h J (l)-p l l + Q(2) 

hj(t) = Hj + < t>ij[hj(t-1) - p, J + 


independence of the sequences Yj(t) and Oj(t) and 
their strict stationarity immediately imply that Yj(t) 
is strictly stationary. Conditions for the existence of 
a stationary GARCH process are much more 
difficult to establish (see, for example. Nelson, 1990 
and Bougerol & Picard, 1992). 

To analyze the dataset, we assumed a latent 
variable (non-observed variable) defined by an 
autoregressive model AR(2) given, for t 
=1,2,3,...,N; j=l,2,...,K, assuming the following 
SV model: 


(3) 

<t>2j[hj(t-l) - p, | + Q(t), t = 3,4,...,N, 


where Q(l) is a noise with a Normal distribution N(0, 
), which is associated to the latent variable hj(t). 
The quantities o^ , p, , (j)!, and (j) 2 j , j =1,2,3,4 are 
unknown parameters that should be estimated ; also I 
<t»ij I < 1 and I <(> 2 j I <1. 

This way, Yj(t) has a normal distribution given 
by: 

Yj(t) ~ N(gj, a2 e hj(t) ), (4) 

where gj = Pj Q + Pjidollar exchange.rate+ 
Pj 2 unemployment.rate+ Pj 3 months + Pj 4 years; 

hj(l) ~ N(pj, of ;-); 

hj(2)l hi(l) ~ N(pj + <^(1) - p, ], of c ); 

(5) 


For a Bayesian analysis of the models, it is 
assumed that the prior distributions for the 
parameters Pj, <t> V j and a fc, > v =1,2; j =1,2,3,4 are, 
respectively, a Normal N(0,a?) distribution, a 
Beta(bj,Cj) distribution and a Gamma(dj,ej) 
distribution, where Beta(b,c) denotes a Beta 
distribution with mean b/(b+c) and variance 
bc/[(b+c) 2 (b+c+1)] and Gamma(d,e) denotes a 
Gamma distribution with mean d/e and variance 
d/e 2 . The hyperparameters aj, bj, Cj, dj and ej are 
considered to be known and are specified latter. 
These prior distributions were chosen by observing 
the ranging of values in each parameter. 

IV. Results assuming the different 

models 


hj(t)l hj(t-l) ~ N(pj + 4>ij[hj(t-l) - p, ] + 4> 2 j[hj(t-l) - p| 

],af s ), t = 3,4,..., N 

The likelihood function of the SV defined by 
(2) given hj(t) which depends on a latent variable or 
unobserved variable is given for each j = 1,2,3,4, by, 

L = n t N =i f[Yj(t) / hj(t)] (6) 

where, f [yj(t) / hj(t)] is the density function of a 
normal distribution N(gj, CT 2 e hj(t) ). 

Bayesian inference based on Markov Chain 
Monte Carlo (MCMC) methods (see, for example, 
Gelfand & Smith, 1990, or Smith & Roberts, 1993) 
has been considered to analyze stochastic volatility 
(SV) models. The main reason for the use of 
Bayesian methods is that we may have great 
difficulties when using a standard classical inference 
approach. 

Those difficulties may appear in the form of 
high dimensionality and likelihood function (note 
that, under a classical approach, we should eliminate 
the latent variables in hj(t) by integration) with no 
closed form, among other factors. 


4.1 Use of multiple linear regression models 

The study data refers to the monthly movement 
of passengers and cargo in the period from January 
01, 2008 to December 31, 2014 at 32 airports in 
the State of Sao Paulo (Andradina, Aracatuba, 
Araraquara, Assis, Avare/Arandu, Barretos, Bauru, 
Bauru/Arealva, Botucatu Braganca Paulista, 
Campinas (Amaral), Dracena, Franca, Itanhaem, 
Jundiaf, Lins, Marflia, Ourinhos, Penapolis, 
Piracicaba, Presidente Epitacio, Presidente Prudente, 
Registro, Ribeirao Preto, Sao Carlos, Sao Jose do 
Rio Preto, Sao Manuel, Sorocaba, Tupa, Ubatuba, 
Urubupunga, Votuporanga) based on statistical year 
reports reported by DAESP (2015). 

In addition to the counts (passenger/cargo), we 
also consider some covariates that may be correlated 
with the flow (passenger/cargo), such as the dollar 
exchange rate and the unemployment rate obtained 
from the economic Brazilian office of ACSP. For 
two airports (Urubupunga and Registro), we have a 
lot of missing data, therefore the data from these two 
airports aren’t considered in the statistical analysis. 

In Figure 1, we present the graphs of the time 
series for the four variables (passenger/cargo counts. 
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dollar exchange rate and unemployment rate to December 31, 2014 considering the total flow for 

reported in the period ranging from January 01, 2008 all airports. 

Figure 1: Time series passenger/cargo counts, dollar exchange rate and unemployment rate on original and 
logarithmic scale. 



From the graphs of Figure 1, we observe that: 

V Apparently, the number of passengers increases over time; then decreases a little; 

V Apparently, the amount of cargo increases over time; then decreases a little; 

V Apparently, the monthly number of passengers decreases with the increased dollar exchange rate. 

To study the relationship between the variables and to find the most important factors affecting the 
variability of passenger/cargo count in the period from January 1, 2008 to December 31, 2014, we considered a 
first statistical analysis assuming multiple regression models relating the covariates dollar exchange rate, 
unemployment rate, years and months with the responses given by passengers and cargo counts. 

4.1.1 Use of a multiple regression model for analysis of data considering the total of all airports 


To satisfy some necessary assumptions of the regression model (normality of errors and constant variance) 
a multiple linear regression model with the response given in the logarithmic scale (1) was considered in this 
study. 

Assuming the responses (passengers/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(passengers) = 12.5 — 0.341 dollarexchangerate — 0.0659 unemploymentrate + 0.0084 month 
+ 0.138 year 


(7) 

In Table 1, we present the least squares estimates (LSE) for the regression coefficients, the standard errors 
(SE) of the estimators, the T-Student statistics values and the p-values to test if each regression parameter is 
equal to zero. 


Table I: LSE estimators for the regression parameters (all airports) 


Passengers (s=0.11549; R=89.5%) 


predictor 

LSE 

SE of coef 

T 

P 

constant 

12.5313 

0.2646 

47.36 

< 0.001 

dollar exchange rate 

-0.34118 

0.05114 

-6.67 

< 0.001 

unemployment rate 

-0.06589 

0.03064 

-2.15 

0.035 

months 

0.008469 

0.005254 

1.61 

0.111 

years 

0.13831 

0.02003 

6.90 

< 0.001 

Cargo (s=0.15216; R J = 

54.8%) 




predictor 

LSE 

SE of coef 

T 

P 

constant 

13.9794 

0.3486 

40.10 

< 0.001 

dollar exchange rate 

-0.40585 

0.06738 

-6.02 

< 0.001 

unemployment rate 

-0.08800 

0.04037 

-2.18 

0.032 
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months 0.003634 0.006922 0.52 0.601 

years_0.02048_0.02639 0.78 0.440 


The necessary assumptions for the model 
(normal residuals and constant variance) were 
verified from residual plots. This was also done for 
all multiple linear regression models considered in 
this study. From the results in Table 1, we have: 

Y Years, dollar exchange rate and 
unemployment rate affect the number of 
passengers (p-value < 0.05), that is, regression 
coefficients are statistically different from 
zero at a significance level of 5%. 

V We observe a positive value for the regression 
parameter related to years (0.13831), which 
implies that there is a significant increase in 
the number of passengers over the years 
(2008-2014). 

V We observe a negative value for the regression 

parameter related to dollar exchange rate (- 
0.34118), which implies that there is a 

significant decrease in the number of 
passengers with increased dollar exchange rate 
(January 01, 2008 to December 31, 2014). 

Y We observe a negative value for the regression 

parameter related to unemployment rate (- 
0.06589), which implies that there is a 

significant decrease in the number of 
passengers with increased unemployment 
(January 01, 2008 to December 31, 2014). 

Y Approximately 89.5% of the data variability is 
explained by the model, that is, an excellent 
fit. 

V An estimator for the standard deviation of 
error is given by 0.1155. 

Assuming the responses (cargo/month) 
transformed to a logarithmic scale and the 
covariates dollar exchange rate, unemployment 
rate, month and year in a multiple regression model 
fitted by the least squares method (obtained with 
the Minitab® software), we have: 
log(cargo) = 14.0 — 0.406 dollarexchangerate 
— 0.0880 unemploymentrate 
+ 0.00363 month + 0.0205year 

(8) 


In Table 1, we also have the least squares 
estimates for the regression coefficients, the 
standard errors (SE) of the estimators for the cargo 
flow, the T-Student statistics values and the p- 
values to test if each regression parameter is equal 
to zero. From the results of Table 1, we have, 

Y The unemployment rate and dollar exchange 
rate affect the transport of cargo (p-value < 
0.05), that is, the regression coefficients are 
statistically different from zero. The months 
and years do not affect the transport of cargo 
(p-value > 0.05). 

Y We observe a negative value for the regression 
parameter related to dollar exchange rate (- 
0.40585), that is, there is a significant decrease 
in cargo transport with increased dollar 
exchange rate (January 01, 2008 to December 
31,2014). 

Y We observe a negative value for the regression 
parameter related to unemployment rate (- 
0.08800), that is, a significant decrease in 
cargo volume with increased unemployment 
(January 01, 2008 to December 31, 2014). 

Y Approximately 55% of the data variability is 
explained by the model. 

Y An estimate for the standard deviation of the 
error is given by 0.1521. 

Using the fitted models (7) and (8), we can 
make predictions for future observations. For 
example, if we consider month = 1 (January), year 
= 8 (2015), dollar exchange rate = 3.30 and 
unemployment rate = 9.50, the expected value is 
given in the original scale by 146,444 for 
passengers and 158,198 for cargo. In Figure 2, we 
have the graphs of the time series for passenger and 
cargo counts reported monthly in the period 
(January 01, 2008 to December 31, 2014) 
considering the total for all airports and the fitted 
values obtained by the regression models (7) and 
(8). A good fit of the model to the data is observed. 
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Figure 2: Time series for observed and fitted values for total passenger and cargo. 




4.1.2. Use of a multiple regression model for the data analysis considering the four airports 

In Figure 3, we have the graphs of the time series for passenger and cargo counts in the airports of Ribeirao 
Preto, Sao Jose do Rio Preto, Bauru/Arealva and Presidente Prudente, the most important airports of the region, 
reported monthly in the period (January 01, 2008 to December 31, 2014). 

Figure 3:Time series for passenger/cargo counts for the airports of Ribeirao Preto, Sao Jose do Rio Preto, 
Bauru/Arealva and Presidente Prudente. 



From Figure 3, we have: 

■S Apparently, the number of passengers increases over time for all airports; then decreases a little. 

■S Apparently, the amount of cargo increases over time to the airports of Ribeirao Preto, Bauru, and Sao Jose 

do Rio Preto, then decreases. The behavior for Presidente Prudente airport is different from the other 
airports. 

4.1.2.1 Response log (passengers/month) and log (cargo/month) - Ribeirao Preto airport 

Assuming the responses (passengers/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(passRP) = 11.7 — 0.427 dollarexchangerate — 0.0812 unemploymentrate + 0.0140 month + 

0.158 year (9) 

In Table II, we have the least squares estimates for the regression coefficients, the standard errors (SE) of 
the estimators, the T-Student statistics values and the p-values to test if each regression parameter is equal to 
zero. 

Table II: LSE for the regression parameters (Ribeirao Preto) 

Passengers (s=0.15594; R 2 =86.5%) 
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predictor 

LSE 

SE of coef 

T 

P 

constant 

11.7171 

0.3573 

32.80 

< 0.001 

dollar exchange rate 

-0.42681 

0.06905 

-6.18 

< 0.001 

unemployment rate 

-0.08115 

0.04137 

-1.96 

0.053 

months 

0.014048 

0.007094 

1.98 

0.051 

years 

0.15831 

0.02705 

5.85 

< 0.001 

ai-go (s=0.32759; R~=22.6%) 

predictor 

LSE 

SE of coef 

T 

P 

constant 

11.8536 

0.7505 

15.79 

< 0.001 

dollar exchange rate 

-0.1714 

0.1451 

-1.18 

0.241 

unemployment rate 

-0.11469 

0.08691 

-1.32 

0.191 

months 

0.01812 

0.01490 

1.22 

0.228 

years 

0.00031 

0.05682 

0.01 

0.996 


Assuming the response (cargo/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year (obtained with the Minitab® software), we have: 
log(cargoRP) = 11.9 — 0.171 dollarexchangerate — 0.115 unemploymentrate + 0.0181months + 

0.0003 years (10) 

In Table 2, we have the least squares estimates for the regression coefficients, the standard errors (SE) of 
the estimators, the T-Student statistics values and the p-values to test if each regression parameter (cargo) is 
equal to zero. 

4.1.2.2 Response log (passengers/month) and log (cargo/month) - Sao Jose do Rio Preto airport 

Assuming the responses (passengers/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(passSJRP) = 11.4 — 0.363 dollarexchangerate — 0.100 unemploymentrate + 0.00297month + 

0.147 year (11) 

In Table III, we have the least squares estimates for the regression coefficients, the standard errors (SE) of 
the estimators, the T-Student statistics values and the p-values to test if each regression parameter is equal to 
zero. 

Table III: LSE estimates for the regression parameters (Sao Jose do Rio Preto) 

Passengers (s=0.14981; R"=87.4%) 


predictor 

LSE 

SE of coef 

T 

P 

constant 

11.4273 

0.3432 

33.29 

< 0.001 

dollar exchange rate 

-0.36318 

0.06634 

-5.47 

< 0.001 

unemployment rate 

-0.10045 

0.03975 

-2.53 

0.013 

months 

0.00297 

0.00682 

0.44 

0.664 

years 

0.14711 

0.02599 

5.66 

< 0.001 

Cargo (s=0.18647; R-= 

=65.3%) 




predictor 

LSE 

SE of coef 

T 

P 

constant 

11.5128 

0.4272 

26.95 

< 0.001 

dollar exchange rate 

-0.43256 

0.08257 

-5.24 

< 0.001 

unemployment rate 

0.04099 

0.04947 

0.83 

0.410 

months 

0.000013 

0.008483 

0.00 

0.999 

years 

-0.05743 

0.03234 

-1.78 

0.080 


Assuming the responses (cargo/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(cargoSJRP) = 11.5 — 0.433 dollarexchangerate + 0.0410 unemploymentrate + 
O.OOOOlmonths—0.574 years (12) 

4.1.2.3 Response log (passengers/month) and log (cargo/month) - Bauru/Arealva airport 
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Assuming the responses (passengers/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(passBauru) = 9.45 — 0.397dollarexchangerate— 0.105 unemploymentrate + 0.0128 months + 

0.217 years (13) 

In Table VI, we have the least squares estimates for the regression coefficients, the standard errors (SE) of the 
estimators, the T-Student statistics values and the p-values to test if each regression parameter is equal to zero. 

Table IV: LSE estimators for the regression parameters (Bauru/Arealva) 


Passengers (s=0.25129; R =82.1%) 


predictor 

LSE 

SE of coef 

T 

P 

constant 

9.4487 

0.5757 

16.41 

< 0.001 

dollar exchange rate 

-0.3965 

0.1113 

-3.56 

0.001 

unemployment rate 

-0.10493 

0.06667 

-1.57 

0.120 

months 

0.01276 

0.01143 

1.12 

0.268 

years 

0.21712 

0.04359 

4.98 

< 0.001 

irgo (s=0.23620; R-=76.7%) 

predictor 

LSE 

SE of coef 

T 

P 

constant 

12.6354 

0.5411 

23.35 

< 0.001 

dollar exchange rate 

-0.3451 

0.1046 

-3.30 

0.001 

unemployment rate 

-0.16404 

0.06267 

-2.62 

0.011 

months 

0.00339 

0.01075 

0.32 

0.753 

years 

0.12271 

0.04097 

3.00 

0.004 


Assuming the responses (cargo/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(cargoBauru) = 12.6 — 0.345 dollarexhangerate — 0.164 unemploymentrate + 0.0034 months + 

0.123 years (14) 

4.1.2.4 Response log(passengers) and log(cargo) - Presidente Prudente airport 

Assuming the responses (passengers/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(passPP) = 9.08 — 0.279 dollarexchangerate + 0.056 unemploymentrate + 0.0339 months + 

0.180 years (15) 

In Table V, we have the least squares estimates for the regression coefficients, the standard errors (SE) of 
the estimators, the T-Student statistics values and the p-values to test if each regression parameter is equal to 
zero. 

Table V. LSE estimates for the regression parameters (Presidente Prudente) 


Passengers (s=0.14254; R =80.7%) 

predictor 

LSE 

SE of coef 

T 

P 


constant 

9.0773 

0.3266 

27.80 

< 0.001 


dollar exchange rate 

-0.27866 

0.06312 

-4.41 

< 0.001 


unemployment rate 

0.05634 

0.03782 

1.49 

0.140 


months 

0.033933 

0.006485 

5.23 

< 0.001 


years 

0.18048 

0.02472 

7.30 

< 0.001 


Cargo (s=0.34393; R‘=29.8%) 

predictor 

LSE 

SE of coef 

T 

p 


constant 

8.0213 

0.7880 

10.18 

< 0.001 


dollar exchange rate 

-0.6919 

0.1523 

-4.54 

< 0.001 


unemployment rate 

0.21114 

0.09125 

2.31 

0.023 


months 

0.03557 

0.01565 

2.27 

0.026 


www.ijmret.org 

ISSN : 

2456-5628 



Page 43 

















International Journal of Modern Research in Engineering and Technology (IJMRET) 
www.ijinret.org Volume 3 Issue 7 II July 2018. 

-_ __I 


years_0.23222_0.05966 3.89 < 0.001 


Assuming the response (cargo/month) transformed to a logarithmic scale and the covariates dollar 
exchange rate, unemployment rate, month and year in a multiple regression model fitted by the least squares 
method (obtained with the Minitab® software), we have: 

log(cargoPP) = 8.02 — 0.692 dollarrate + 0.211 unemploymentrate + 0.0356 months + 0.232 years 
(16) 

4.2. Statistical analysis of the data of the airports in the State of Sao Paulo using stochastic volatility (SV) 
models under a Bayesian approach 

Let us assume Beta (1,1) prior distributions for (j) |j, uniform U(0,0.1) prior distributions for <t» 2 j , 
Gamma(l.l) prior distributions for , normal N(0,1) prior distributions for p, ,normal N(10,l) prior 
distributions for Pj 0 and normal N(0,1) prior distributions for Pji, j =1,2,3,4; 1 =1,2,3,4. 

A burn-in sample with 21,000 samples is considered to eliminate the effect of the initial values in the 
iterative method; after that, another 90,000 samples are generated taking samples from 10 to 10 totaling a final 
sample size of 9,000 to get the posterior summaries of interest (see Table 6). 

In the simulation of samples of the joint posterior distribution of interest the OpenBugs software is used 
(Spiegelhalter et al, 2003). The convergence of the Gibbs sampling algorithm was verified from standard 
traceplots of the generated Gibbs samples. 

Considering the cargo volume in the four airports, the same prior distributions assumed for the passenger 
case are assumed, that is. Beta (1.1) prior distributions for tpjj, uniform U (0.1) prior distributions for <t> 2 j. Gamma 
(1.1) prior distributions for abnormal N (0.1) prior distributions for p, , normal N (10.1) prior distributions for 
Pj 0 andN (0.1) prior distributions for Pji, j = 1,2,3,4; 1 = 1,2,3,4. 

Also using the OpenBugs software, a burn-in sample size of 21,000 is considered; then, we generated 
another 90,000 samples taking samples from 10 to 10 totaling a final sample size of 9,000 to get the posterior 
summaries of interest (see Table VII). 


Table VI: Posterior s ummaries - passengers 



mean 

sd 

LL2.5% 

UL97.5% 

Pio 

11.07 

0.3404 

10.46 

11.81 

Pll 

-0.3385 

0.1178 

-0.621 

-0.1739 

Pl2 

-0.03404 

0.03757 

-0.1123 

0.04636 

P 13 

0.02243 

0.007268 

0.007812 

0.03657 

P 14 

0.1752 

0.02824 

0.1241 

0.2374 

P20 

11.06 

0.4434 

10.23 

11.91 

P21 

-0.2935 

0.09021 

-0.5058 

-0.1609 

P22 

-0.08124 

0.03879 

-0.1575 

-0.007583 

P 23 

0.007795 

0.007734 

-0.007614 

0.02227 

P 24 

0.1667 

0.02578 

0.1146 

0.2149 

P30 

8.675 

0.3949 

7.922 

9.469 

P31 

-0.3248 

0.1028 

-0.5348 

-0.1395 

P32 

-0.04081 

0.04516 

-0.1344 

0.04643 

P33 

0.02598 

0.007805 

0.009535 

0.04048 

P34 

0.2378 

0.02762 

0.1825 

0.2922 

P40 

9.234 

0.2451 

8.707 

9.697 

P41 

-0.22 

0.04789 

-0.3116 

-0.124 

P42 

0.04343 

0.0281 

-0.006466 

0.1049 

P43 

0.02887 

0.003767 

0.022 

0.0369 

P44 

0.1445 

0.01909 

0.1093 

0.1855 

bl 

-2.629 

0.9792 

-4.1 

-0.4999 

b2 

-2.749 

1.018 

-4.044 

-0.5457 

b3 

-2.416 

0.7627 

-3.637 

-0.6911 

b4 

-0.9206 

0.8072 

-2.464 

0.6866 

4>n 

0.8498 

0.1268 

0.5029 

0.9881 

4*12 

0.7309 

0.2486 

0.1116 

0.979 

^13 

0.8444 

0.08187 

0.66 

0.9713 

4*14 

0.9536 

0.02412 

0.9035 

0.9945 

4*21 

0.05135 

0.02852 

0.002768 

0.0977 

4*22 

0.0503 

0.02851 

0.002886 

0.09743 
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4*23 

0.04909 

0.02867 

0.002549 

0.097 

4*24 

0.0658 

0.0244 

0.01118 

0.09863 

Cl 

2.268 

1.005 

0.8512 

4.77 

C2 

2.924 

1.331 

1.094 

6.248 

C3 

1.909 

0.843 

0.7078 

3.997 

C4 

2.78 

1.148 

1.111 

5.537 


(sd: standard deviation; LL2.5%: lower limit; UL97.5%: upper limit) 

Table VII: Posterior summaries - cargo 



mean 

sd 

LL2.5% 

UL97.5% 

Pio 

10.76 

0.5436 

9.8 

11.81 

Pn 

-0.1753 

0.06556 

-0.3101 

-0.04929 

Pl2 

-0.01155 

0.05656 

-0.1158 

0.09262 

P 13 

0.03134 

0.01098 

0.008729 

0.05112 

Pl4 

0.09188 

0.03801 

0.01714 

0.1584 

P20 

11.54 

0.3735 

10.83 

12.33 

P21 

-0.378 

0.1028 

-0.5565 

-0.1454 

P22 

0.02489 

0.04428 

-0.0643 

0.1138 

P23 

-0.001191 

0.008008 

-0.01742 

0.01418 

P24 

-0.06743 

0.03149 

-0.1328 

-0.005542 

P30 

11.86 

0.3814 

11.06 

12.58 

P31 

-0.3563 

0.09295 

-0.5425 

-0.1762 

P32 

-0.08597 

0.04153 

-0.1671 

9,80E-01 

P33 

0.02059 

0.006896 

0.0064 

0.03353 

P34 

0.155 

0.02563 

0.106 

0.2081 

P40 

8.312 

0.6196 

7.038 

9.519 

P41 

-0.5819 

0.1427 

-0.8501 

-0.2911 

P42 

0.1567 

0.06345 

0.03276 

0.2824 

P43 

0.0363 

0.0102 

0.01685 

0.05694 

P44 

0.1883 

0.04512 

0.1027 

0.2724 

Pi 

-1.871 

1.084 

-3.403 

0.4202 

P2 

-3.147 

0.6257 

-3.828 

-1.247 

P3 

-2.339 

0.8223 

-3.67 

-0.4718 

fU 

-0.6796 

0.8618 

-2.327 

1.011 

<1*11 

0.7816 

0.1544 

0.4134 

0.9768 

<1*12 

0.5057 

0.2594 

0.0409 

0.9401 

<1*13 

0.847 

0.08634 

0.6412 

0.9786 

<1*14 

0.8933 

0.05804 

0.7487 

0.9788 

<1*21 

0.04734 

0.0285 

0.002384 

0.09696 

<1*22 

0.05025 

0.02852 

0.002646 

0.09717 

<1*23 

0.05043 

0.02857 

0.002579 

0.09753 

<1*24 

0.06185 

0.02479 

0.009566 

0.0983 

Cl 

1.141 

0.5293 

0.4524 

2.451 

C2 

2.306 

1.117 

0.8192 

5.143 

C3 

1.801 

0.7774 

0.7008 

3.687 

C4 

2.811 

1.261 

1.054 

5.9 


(sd: standard deviation; LL2.5%: lower limit; UL97.5%: upper limit) 


5. Discussion of the obtained results for each airport using the two proposed models 

5.1 Ribeirao Preto airport 

5.1.1 Use of a multiple linear regression model 


From the results of Table 2, we observe that: 

V Years, months, dollar exchange rate and unemployment rate affect the number of passengers in Ribeirao 
Preto airport (p-value < 0.05 or very close to 0.05), that is, the regression coefficients are statistically 
different from zero at a significance level of 5%. 

V There is a positive value for the regression parameter related to years (0.15831), which implies that there is 


www.ijmret.org 


ISSN: 2456-5628 


Page 45 










International Journal of Modern Research in Engineering and Technology (IJMRET) 
www.ijmret.org Volume 3 Issue 7 II July 2018. 

I J 

a significant increase in the number of passengers in Ribeirao Preto airport over the years (2008-2014). 

■S There is a negative value for the regression parameter related to dollar exchange rate (-0.42691), which 
implies that there is a significant decrease in the number of passengers in Ribeirao Preto airport with 
increased dollar exchange rate (January 01, 2008 to December 31, 2014). 

■S There is a negative value for the regression parameter related to unemployment rate (-0.08115), which 
implies that there is a significant decrease in the number of passengers in Ribeirao Preto airport with 
increased unemployment (January 01, 2008 to December 31, 2014). 

V There is a positive value for the regression parameter related to months (0.014048), which implies that 
there is a significant increase in the number of passengers over the months. The end of each annual period 
leads to a significant increase in the number of passengers. 

V Approximately 86.5% of the variability of the data (number of passengers) is explained by the model. That 
is, we have an excellent fit. 

V An estimator for the standard deviation of error is given by 0.1559. 

From the results of Table 2, we have: 

V No covariate (month, year, unemployment rate and dollar exchange rate) affects cargo transport in Ribeirao 
Preto airport (p-value > 0.05), that is, the regression coefficients are not statistically different from zero. 
There is stability in cargo transport in Ribeirao Preto airport regardless of the variability of the covariates 
(month, year, unemployment rate and dollar exchange rate). This fact can also be seen in Figure 3. 

■S Approximately 22.6% of the data variability is explained by the model. 

■S An estimator for the standard deviation of the error is given by 0.3276. 

Using the fitted models (9) and (10) and considering month = 1 (January), year = 8 (2015), dollar exchange 
rate = 3.30 and unemployment rate = 9.50, the forecast value is given in the original scale by 49,901 for 
passengers and 27,424 for cargo volume in Ribeirao Preto airport. 

In Figure 4, we present the graphs of the time series for passenger and cargo counts reported in the period 
(January 01, 2008 to December 31, 2014) for Ribeirao Preto airport and the fitted values given by the regression 
models (9) and (10). A good fit of the models to the data is observed. 

Figure 4: Time series for observed values and fitted values for passengers and cargo-Ribeirao Preto. 




5.1.2 Use of a stochastic volatility (SV) model 

From the results in Table 6, similar results 
as obtained using a multiple linear regression 
model are obtained under a Bayesian approach for 
the passengers case, assuming a stochastic 
volatility (SV) model: 

V Years, months and dollar exchange rate affect 
the number of passengers in Ribeirao Preto 
airport (the 95% credible interval does not 
contain zero), that is, the regression 
coefficients are statistically different from 
zero. Observe that a 95% credible interval 
corresponds to a 95% confidence interval 
under a classical inference approach, that is, a 


5% significance level to test if each regression 
parameter is equal to zero against an 
alternative to be different from zero. Under a 
Bayesian approach, we get the inferences 
using the credible intervals. 

V A positive value is observed for the estimator 
of the regression parameter related to years 
(0.1752), which implies that there is a 
significant increase in the logarithm of the 
number of passengers in Ribeirao Preto airport 
over the years (2008/2014). 

V A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.3385), which implies 
that there is a significant decrease in the 


www.ijmret.org 


ISSN: 2456-5628 


Page 46 






























International Journal of Modern Research in Engineering and Technology (IJMRET) 
www.ijmret.org Volume 3 Issue 7 II July 2018. 

I_ __ ) 


logarithm of the number of passengers in 
Ribeirao Preto airport with increased dollar 
exchange rate (January 01, 2008 to December 
31,2014). 

Y A positive value is observed for the estimator 
of the regression parameter related to months 
(0.02243), which implies that there is a 
significant increase in the logarithm of the 
number of passengers over the months. End of 
year leads to a significant increase in the 
number of passengers. 

From the results in Table 7, some important 

interpretations are obtained for the cargo case: 

Y Years, months and dollar exchange rate affect 
the cargo transport in Ribeirao Preto airport 
(the 95% credible interval does not contain 
zero), that is, the regression coefficients are 
statistically different from zero. 

Y A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.1753), which implies 
that there is a significant decrease in the 
logarithm of cargo transport in Ribeirao Preto 
airport with increased dollar exchange rate. 

Y A positive value is observed for the estimator 

of the regression parameter related to months 
(0.03134), which implies that there is a 

significant increase in the logarithm of cargo 
transport in Ribeirao Preto airport (end of year 
leads to increased cargo volume). 

Y A positive value is observed for the estimator 

of the regression parameter related to years 
(0.09188), which implies that there is a 

significant increase in the logarithm of cargo 
transport in Ribeirao Preto airport over the 
years. 

5.2 Sao Jose do Rio Preto airport 

5.2.1 Use of a multiple linear regression model 

From the results of Table 3, we have: 

Y Years, dollar exchange rate and 

unemployment rate affect the number of 
passengers in Sao Jose do Rio Preto airport (p- 
value < 0.05), that is, the regression 
coefficients are statistically different from 

zero at a significance level of 5%. 

Y We observe a positive estimate for the 

regression parameter related to years 

(0.14711), which implies that there is a 
significant increase in the number of 

passengers in Sao Jose do Rio Preto over the 
years (2008/2014). 

Y We observe a negative value for the regression 

parameter related to dollar exchange rate (- 
0.36318), which implies that there is a 
significant decrease in the number of 

passengers in Sao Jose do Rio Preto airport 


with increased dollar exchange rate (January 
01, 2008 to December 31, 2014). 

We observe a negative value for the regression 
parameter related to unemployment rate (- 
0.10045), which implies that there is a 
significant decrease in the number of 
passengers in Sao Jose do Rio Preto airport 
with increased unemployment (January 01, 
2008 to December 31, 2014). 

Y Approximately 87.4% of the data variability is 
explained by the model, which is an excellent 
fit. 

Y An estimator for the standard deviation of the 
error is given by 0.1498. 

In Table 3, we have the least squares estimates 
for the regression coefficients, the standard errors 
(SE) of the estimators, the T-Student statistics 
values and the p-values to test if each regression 
parameter is equal to zero. From the results of 
Table 3, we have: 

Y The dollar exchange rate affects the transport 
of cargo (p-value < 0.05), that is, the 
regression coefficients are statistically 
different from zero at a significance level of 
5%. For years we have a significance effect 
considering a significance level of 10% (p- 
value <0.10). 

Y We observe a negative estimate for the 

regression parameter related to years (- 
0.05743), which implies that there is a 

significant decrease in cargo transport in Sao 
Jose do Rio Preto airport over the years (2008- 
2014). 

Y We observe a negative value for the regression 

parameter related to dollar exchange rate (- 
0.43256), which implies that there is a 

significant decrease in cargo transport in Sao 
Jose do Rio Preto airport with increased dollar 
exchange rate (January 01, 2008 to December 
31,2014). 

Y Approximately 65.3% of the data variability is 
explained by the model. 

Y An estimator for the standard deviation of the 
error is given by 0.1865. 

Using the fitted models (11) and (12) and 
considering month = 1 (January), year = 8 (2015), 
dollar exchange rate = 3.30 and unemployment 
rate = 9.50, the forecast value is given in the 
original scale by 34.697 for passengers and 22.366 
for the cargo volume for Sao Jose do Rio Preto 
airport. In Figure 5, we have the graphs of the time 
series for monthly passenger and cargo counts 
reported in the period (January 01, 2008 to 
December 31, 2014) at Sao Jose do Rio Preto 
airport and the fitted values obtained by the 
regression models (11) and (12). We observe a 
reasonable fit of the models for the data. 




www.ijmret.org 


ISSN: 2456-5628 


Page 47 






International Journal of Modern Research in Engineering and Technology (IJMRET) 
www.ijmret.org Volume 3 Issue 7 II July 2018. 

I_ __ 


Figure 5:Time series for the observed and fitted values - passenger and cargo, Sao Jose do Rio Preto. 




5.2.2 Use of a stochastic volatility (SV) model 

From the results in Table 6, some 
interpretations under a Bayesian approach for the 
passengers case, assuming a stochastic volatility 
model are given as follows: 

V Years, dollar exchange rate and 
unemployment rate affect the number of 
passengers in Sao Jose do Rio Preto airport 
(the 95% credible interval does not contain 
zero), that is, the regression coefficients are 
statistically different from zero. 

■S A positive value is observed for the estimator 
of the regression parameter related to years 
(0.1667), which implies that there is a 
significant increase in the logarithm of the 
number of passengers in Sao Jose do Rio Preto 
airport over the years (2008-2014). 

■S A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.2935), which implies 
that there is a significant decrease in the 
logarithm of the number of passengers in Sao 
Jose do Rio Preto airport with increased dollar 
exchange rate (January 01, 2008 to December 
31,2014). 

V A negative value is observed for the estimator 

of the regression parameter related to the 
unemployment rate (-0.08124), which implies 
that there is a significant decrease in the 
logarithm of the number of passengers in Sao 
Jose do Rio Preto airport with increased 
unemployment (January 01, 2008 to 

December 31, 2014). 

From the results in Table 7, some 
important interpretations are obtained for the cargo 
case: 

■S Years and dollar exchange rate affect cargo 
transport in Sao Jose do Rio Preto airport (the 
95% credible interval does not contain zero), 
that is, the regression coefficients are 
statistically different from zero. The other 
factors are not significant, possibly due to the 


presence of some outlier that could affect the 
obtained inference. 

•S A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.3780), which implies 
that there is a significant decrease in the 
logarithm of cargo transport in Sao Jose do 
Rio Preto airport with increased dollar 
exchange rate. 

•S A negative value is observed for the estimator 
of the regression parameter related to years (- 
0.06743), which implies that there is a 
significant decrease in the logarithm of cargo 
transport in Sao Jose do Rio Preto airport over 
the years. 

5.3 Bauru/Arealva airport 

5.3.1 Use of a multiple linear regression model 

From the results of Table 4, we have: 

•S Years and dollar exchange rate affect the 
number of passengers (p-value < 0.05), that is, 
the regression coefficients are statistically 
different from zero at a significance level of 
5%. 

•S We observe a positive estimate for the 

regression parameter related to years 
(0.21712), which implies that there is a 
significant increase in the number of 
passengers in Bauru airport over the years 
(2008-2014). 

•S We observe a negative estimate for the 

regression parameter related to dollar 
exchange rate (-0.3965), which implies that 
there is a significant decrease in the number of 
passengers in Bauru airport with increased 
dollar exchange rate (January 01, 2008 to 
December 31, 2014). 

•S Approximately 82.1 % of the data variability is 
explained by the model. That is, we have an 
excellent fit. 

•S An estimator for the standard deviation of 
error is given by 0.2513. 
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In Table 4, we have the least squares estimates 
for the regression coefficients, the standard errors 
(SE) of the estimators, the T-Student statistics 
values and the p-values to test if each regression 
parameter is equal to zero. From the results of 
Table 4, we have: 

V Dollar exchange rate and unemployment rate 
affect cargo transport (p-value < 0.05), that is, 
the regression coefficients are statistically 
different from zero at a significance level of 
5%. 

V We have a positive estimate for the regression 
parameter relatad to years (0.12271), which 
implies that there is a significant increase in 
cargo transport in Bauru airport over the years 
(2008/2014). 

V We have a negative estimate for the regression 
parameter related to dollar exchange rate (- 
0.3451), which implies that there is a 
significant decrease in cargo transport in 
Bauru airport with increased dollar exchange 
rate (January 01, 2008 to December 31, 2014). 

V We have a negative value for the regression 
parameter related to unemployment rate (- 


0.16404), which implies that there is a 
significant decrease in cargo transport in 
Bauru airport with increased unemployment 
(January 01, 2008 to December 31, 2014). 

•S Approximately 76.7% of the data variability is 
explained by the model. 

•S An estimator for the standard deviation of the 
error is given by 0.2362. 

Using the fitted models (13) and (14) and 
considering month = 1 (January), year = 8 (2015), 
dollar exchange rate = 3.30 and unemployment rate 
= 9.50, the forecast value is given in the original 
scale by 7.282 for passengers and 55.453 for cargo 
volume in the Bauru/Arealva airport. 

In Figure 6, we have the graphs of the time 
series for passenger and cargo counts reported in 
the period (January 01, 2008 to December 31, 
2014) for the Bauru/Arealva airport and the fitted 
values obtained from the regression models (13) 
and (14). A good fit of the models for the data is 
observed. 


Figure 6: Time series for observed values and fitted values-passengers and cargo, Bauru/Arealva. 



5.3.2 Use of a stochastic volatility (SV) model 

From the results in Table 6, some 
interpretations under a Bayesian approach for the 
passengers case, assuming a stochastic volatility 
model are given as follows: 

■S Years, months and dollar exchange rate affect 
the number of passengers in Bauru airport (the 
95% credible interval does not contain zero), 
that is, the regression coefficients are 
statistically different from zero. 

■S A positive value is observed for the estimator 
of the regression parameter related to years 
(0.2378), which implies that there is a 
significant increase in the logarithm of the 
number of passengers in Bauru airport over 
the years (2008-2014). 

V A negative value is observed for the estimator 
of the regression parameter related to the 


dollar exchange rate (-0.3248), which implies 
that there is a significant decrease in the 
logarithm of the number of passengers in 
Bauru airport with increased dollar exchange 
rate (January 01, 2008 to December 31, 2014). 
•S A positive value is observed for the estimator 
of the regression parameter related to months 
(0.02598), which implies that there is a 
significant increase in the logarithm of the 
number of passengers over the months. End of 
year leads to a significant increase in the 
number of passengers. 

From the results in Table 7, it is observed 
for the cargo case: 

V Years, months and dollar exchange rate affect 
cargo transport in Bauru airport (the 95% 
credible interval does not contain zero), that 
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is, the regression coefficients are statistically 
different from zero. 

■S A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.3563), which implies 
that there is a significant decrease in the 
logarithm of cargo transport in Bauru airport 
with increased dollar exchange rate. 

■S A positive value is observed for the estimator 
of the regression parameter related to months 
(0.02059), which implies that there is a 
significant increase in the logarithm of cargo 
transport in Bauru airport over the months 
(end of year leads to increased cargo volume). 

V A positive value is observed for the estimator 
of the regression parameter related to years 
(0.1550), which implies that there is a 
significant increase in the logarithm of cargo 
transport in Bauru airport over the years. 

5.4 Presidente Prudente airport 

5.4.1 Use of a multiple linear regression model 

From the results of Table 5, we have: 

■S Years, months and dollar exchange rate affect 
the number of passengers (p-value < 0.05), 
that is, the regression coefficients are 
statistically different from zero at a 
significance level of 5%. In this airport, a 
significant effect of unemployment rate in the 
number of passengers is not observed (p-value 
> 0.05). 

V We observe a positive estimate for the 

regression parameter related to years 

(0.18048), which implies that there is a 
significant increase in the number of 
passengers in Presidente Prudente airport over 
the years (2008/2014). 

V We observe a negative estimate for the 

regression parameter related to dollar 

exchange rate (-0.27866), which implies that 
there is a significant decrease in the number of 
passengers in Presidente Prudente airport with 
increased dollar exchange rate (January 01, 
2008 to December 31, 2014). 

■S We observe a positive estimate for the 
regression parameter related to months 

(0.03393), which implies that there is a 
significant increase in the number of 

passengers over the months (end of year leads 
to increased number of passengers). 

V Approximately 80.7% of the data variability is 
explained by the model, which is an excellent 
fit. 

■S An estimator for the standard deviation of 
error is given by 0.1425. 


In Table 5, we have the least squares estimates 
for the regression coefficients, the standard errors 
(SE) of the estimators, the T-Student statistics 
values and the p-values to test if each regression 
parameter is equal to zero. From the results of 
Table 5, we have: 

V Years, months, dollar exchange rate and 
unemployment rate affect the air transport of 
cargo (p-value <0.05), that is, the regression 
coefficients are statistically different from 
zero at a significance level of 5%. 

V A positive estimate is observed for the 
regression parameter related to years (0.2322), 
which implies that there is a significant 
increase in cargo transport in Presidente 
Prudente airport over the years (2008 to 2014). 

V There is a positive estimate for the regression 
parameter related to months (0.03557), which 
implies that there is a significant increase in 
cargo transport in Presidente Prudente airport 
over the months. 

V We have a negative estimate for the regression 

parameter related to dollar exchange rate (- 
0.6919), which implies that there is a 

significant decrease in cargo transport in 
Presidente Prudente airport with increased 
dollar exchange rate (January 01, 2008 to 
December 31, 2014). 

V We have a positive estimate for the regression 

parameter related to unemployment rate 

(0.21214), which implies that there is a 
significant increase in cargo transport in 
Presidente Prudente airport with increased 
unemployment (January 01, 2008 to 

December 31, 2014). Note that the 

unemployment rate is related to the entire 
State of Sao Paulo. 

V Approximately 29.8% of the data variability is 
explained by the model. It is important to note 
that other factors not included in the model 
can also have significant effects on the 
variability of cargo transport in Presidente 
Prudente airport. 

V An estimator for the standard deviation of the 
error is given by 0.3439. Using the fitted 
models (9) and (10) and considering month = 
01 (January), year = 8 (2015), dollar exchange 
rate = 3.30 and unemployment rate = 9.50, the 
forecast value is given in the original scale by 
26.129 passengers and 15.323 in the cargo 
volume in Presidente Prudente airport. 

In Figure 7, we have the graphs of the time 
series for monthly passenger and cargo counts 
reported in the period (January 01, 2008 to 
December 31, 2014) in Presidente Prudente airport 
and the fitted values by the regression models (15) 
and (16). We observe a reasonable fit of the models 
to the data. 
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Figure 7: Time series for observed values and fitted values - passenger and cargo, Presidente Prudente. 



5.4.2 Use of a stochastic volatility (SV) model 

From the results in Table 6, some important 
observations are reported in the following 
subsections. 

■S Years, months and dollar exchange rate affect 
the number of passengers in Presidente 
Prudente airport (the 95% credible interval 
does not contain zero), that is, the regression 
coefficients are statistically different from 
zero. 

V A positive value is observed for the estimator 
of the regression parameter related to years 
(0.1445), which implies that there is a 
significant increase in the logarithm of the 
number of passengers in Presidente Prudente 
airport over the years (2008-2014). 

V A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.2200), which implies 
that there is a significant decrease in the 
logarithm of the number of passengers in 
Presidente Prudente airport with increased 
dollar exchange rate (January 01, 2008 to 
December 31, 2014). 

■S A positive value is observed for the estimator 
of the regression parameter related to months 
(0.1445), which implies that there is a 
significant increase in the logarithm of the 
number of passengers in Presidente Prudente 
airport over the months (end of year leads to 
increased number of passengers). 

From the analyses of the results in Table 7, some 
important observations are reported in the 
following subsections. 

V Years, months, unemployment rate and dollar 
exchange rate affect cargo transport in 
Presidente Prudente airport (the 95% credible 
interval does not contain zero), that is, the 


regression coefficients are statistically 

different from zero. 

•S A negative value is observed for the estimator 
of the regression parameter related to the 
dollar exchange rate (-0.5819), which implies 
that there is a significant decrease in the 
logarithm of cargo transport in Presidente 
Prudente airport with increased dollar 

exchange rate. 

•S A positive value is observed for the estimator 
of the regression parameter related to months 
(0.0363), which implies that there is a 

significant increase in the logarithm of cargo 
transport in Presidente Prudente airport over 
the months (end of year leads to increased 
cargo volume). 

•S A positive value is observed for the estimator 
of the regression parameter related to years 
(0.1883), which implies that there is a 

significant increase in the logarithm of cargo 
transport in Presidente Prudente airport over 
the years. 

•S A positive value is observed for the estimator 
of the regression parameter related to 
unemployment rate (0.1567), which implies 
that there is a significant increase in the 
logarithm of cargo transport in Presidente 
Prudente airport with increased 
unemployment. 

6. Stochastic volatility model fit and the 
estimated volatilities 

From the fitted model for the dataset related to 
the four airports (Ribeirao Preto, Sao Jose do Rio 
Preto, Bauru/Arealva and Presidente Prudente), 
Figures 8, 9, 10 and 11 present the fitted means and 
the observed values for passenger and cargo 
transport in the period ranging from January 01, 
2008 to December 31, 2014. 
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Figure 8: Time series for observed values and fitted values - passengers and cargo, Ribeirao Preto. 
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Figure 9: Time series for observed values and fitted values - passengers and cargo, Sao Jose do Rio Preto. 



Figure 10: Time series for observed values and fitted values - passengers and cargo, Bauru/Arealva. 



Figure 11 :Time series for observed values and fitted values - passengers and cargo, Presidente Prudente. 



In Figure 12, we present the graphs of the square roots for the volatilities considering the dataset of the four 
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airports (monthly volume of passengers and cargo). 

Figure 12: Square roots for volatilities (passengers/cargo) of the airports. 



From the plots of Figure 12, we get some important interpretations: 


Passengers: 

V There is great volatility regarding the number of 
passengers in Ribeirao Preto airport between 
month 40 (April, 2011) and month 50 
(February, 2012). Outside this period, there is 
little volatility. 

V Similar behavior is observed for the number of 
passengers in Sao Jose do Rio Preto airport; the 
volatility increases at the end of the observed 
period, that is, close to the end of 2014 (year in 
which serious economic problems start in 
Brazil). 

V Bauru airport has great volatility regarding the 
number of passengers close to month 25 
(January, 2010) and month 55 (end of 2012). 

V Presidente Prudente airport has similar behavior 
regarding volatility of the number of 
passengers, as observed for the Bauru airport. 


emphasis on the best practices inherent to the 
concept of a hub. 

7. Concluding remarks 

From the results obtained in this study, it was 
possible to observe,with both assumed: models 
multiple linear regression model with normal errors 
or the SV model, that some periods of the year imply 
great volatilities of the air transport of passengers 
and cargo for the four studied airports in the State of 
Sao Paulo during the study period. It was also 
possible to identify important factors affecting the 
volume of passengers and cargo for these airports 
using the multiple linear regression model 
introduced in Section 3 and the SV models 
introduced in Section 4. 


Cargo: 

V Ribeirao Preto airport has seasonal behavior 
regarding volatilities in cargo transport (in 
some periods of the year there is greater 
volatility); more volatility is observed close to 
month 65, which corresponds to 2013. 

V Similar behavior is observed regarding cargo 
transport in Sao Jose do Rio Preto airport. 

V Bauru and Presidente Prudente airports have 
great volatilities regarding cargo transport, 
especially between months 10 and 35 (end of 
2008 and end of 2010) and close to month 55 
(middle of 2012). 

The volatility of the data shows the dependence 
of the effective use of air transport on the variability 
of passenger demand (tourism and corporate 
business) and cargo (potential of local production 
that requires the use of air transport for the flow of 
products), directing efforts to the economic and 
financial viability of the national air transport, with 


Although the obtained interpretations in terms 
of inference results are similar with both proposed 
models, multiple linear regression model with 
normal errors or the SV model. The SV model has 
an important advantage over the standard multiple 
linear regression model in identification of important 
factors affecting the passengers and cargo flow in 
airports in Sao Paulo state, and in the modeling fit: 
the estimation of the volatilities of flow in each 
airport. This could be an important result for airport 
managers. 

Another important result was that the fitted 
models used in this study can be of great importance 
in the prediction of the volume of passengers and 
cargo related to the economic factors considered. 
These results are of great interest to airport managers 
planning to build new large hubs for passengers or 
cargo as an alternative for the large airports located 
in Sao Paulo and Campinas. Other structures of SV 
models could also be considered to analyze the 
dataset considering autoregressive model AR(L) 
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structures larger than 2, to get better fit of the model 
for the data. In addition, it is important to point out 
that the use of a Bayesian approach with Markov 
Chain Monte Carlo (MCMC) methods is facilitated 
using free software available, such as OpenBugs, 
which gives a great simplification in the 
computational work to get the posterior summaries 
of interest. 

The use of the new stochastic volatility (SV) 
model proposed in this manuscript was very 
important in the discovery of the most important 
factors affecting the means of the time series in each 
one of the four airports located in the State of Sao 
Paulo and also in the modeling of the volatilities 
associated to the flow of passengers and the volume 
of cargo during the time period assumed in this 
study. As observed in the results obtained, some of 
the airports are more dependent on unemployment 
rates than others. It is important to point out that 
some of the airports are located in regions with more 
economic stability. For future studies, it would be 
possible to consider other volatility structures and to 
compare the results obtained with results from other 
statistical approaches, for example using generalized 
linear models to count data in the original scale and 
not in the logarithm scale, as considered in this 
study. 

It is also important to point out that, in future 
studies, the authors could apply the proposed 
statistical model for new datasets from the Brazilian 
office of the National Civil Aviation Agency 
(AN AC) linked to the major passenger and cargo 
airports in Brazil in order to identify the dependency 
level of the demand of these airports related to the 
volatility of each airport, as a criterion to plan the 
logistics project of air transport in Brazil. 
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